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1.  Introduction 

Plan  recognition  is  the  task  of  predicting  an  agent’s  top-level  plans  based  on  its  ob¬ 
served  actions.  It  is  an  abductive  reasoning  task  that  involves  inferring  cause  from 
effect  (Chamiak  and  McDermott,  1985).  Early  approaches  to  plan  recognition 
were  based  on  first-order  logic  in  which  a  knowledge-base  of  plans  and  actions 
is  developed  for  the  domain  and  then  default  reasoning  (Kautz  and  Allen,  1986; 
Kautz,  1987)  or  logical  abduction  (Ng  and  Mooney,  1992)  is  used  to  predict  the 
best  plan  based  on  the  observed  actions.  Kautz  and  Allen  (1986);  Kautz  (1987) 
developed  one  of  the  first  logical  formalizations  of  plan  recognition.  They  used 
non-monotonic  deductive  inference  to  predict  plans  using  observed  actions,  an  ac¬ 
tion  taxonomy,  and  a  set  of  commonsense  rules  or  constraints.  Lesh  and  Etzioni 
(1995)’s  approach  to  goal  recognition  constructs  a  graph  of  goals,  actions,  and 
their  schemas  and  prunes  the  network  until  the  plans  present  in  the  network  are 
consistent  with  the  observed  goals.  The  approach  by  Hong  (2001)  also  constructs 
a  “goal  graph”  and  analyses  the  graph  to  identify  goals  consistent  with  observed 
actions.  However,  these  approaches  are  unable  to  handle  uncertainty  in  the  obser¬ 
vations  or  background  knowledge  and  are  incapable  of  estimating  the  likelihood 
of  different  plans. 

Another  approach  to  plan  recognition  is  to  directly  use  probabilistic  methods.  Al¬ 
brecht  et  al.  (1998)  developed  an  approach  based  on  dynamic  Bayesian  networks 
to  predict  plans  in  an  adventure  game.  Horvitz  and  Pack  (1999)  developed  an 
approach  that  uses  Bayesian  networks  to  recognize  goals  in  an  automated  conver¬ 
sation  system.  Pynadath  and  Wellman  (2000)  extended  probabilistic  context-free 
grammars  to  plan  recognition.  Kaminka  et  al.  (2002)  developed  an  approach  to 
multiagent  plan  recognition  using  dynamic  Bayesian  networks  to  perform  moni¬ 
toring  in  distributed  systems.  Bui  et  al.  (2002);  Bui  (2003)  used  Abstract  Hidden 
Markov  Models  for  hierarchical  goal  recognition.  Saria  and  Mahadevan  (2004) 
extended  the  work  by  Bui  (2003)  to  multiagent  plan  recognition.  Blaylock  and 
Allen  (2005)  used  statistical  n-gram  models  for  the  task  of  instantiated  goal  recog¬ 
nition.  While  these  approaches  can  handle  uncertainty  and  can  be  trained  effec- 
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lively,  they  ean  not  handle  the  kind  of  struetured  relational  data  that  ean  be  rep¬ 
resented  in  first-order  predieate  logie.  Furthermore,  it  is  diffieult  to  ineorporate 
planning  domain  knowledge  in  these  approaehes. 

The  third  eategory  of  approaehes  use  aspeets  of  both  logieal  as  well  as  proba- 
bilistie  reasoning.  Hobbs  et  al.  (1988)  attaeh  weights  or  eosts  to  predieates  in  the 
knowledge  base  and  use  these  weights  to  guide  the  seareh  for  the  best  explana¬ 
tion.  Goldman  et  al.  (1999)  use  the  probabilistie  Horn  abduetion  (Poole,  1993) 
framework  to  find  the  best  set  of  plans  that  explain  the  observed  aetions.  Several 
other  approaches  use  Bayesian  networks  (Charniak  and  Goldman,  1989,  1991; 
Huber  et  ah,  1994)  to  perform  abductive  inference.  Based  on  the  observed  actions 
and  a  knowledge  base  constructed  for  planning,  Bayesian  networks  are  automati¬ 
cally  constructed  using  knowledge  base  model  construction  (KBMC)  procedures. 
However,  most  of  these  approaches  do  not  have  the  capabilities  for  learning  the 
structure  or  the  parameters.  Another  chapter  by  Inoue  et  al.  (2013)  appearing  in 
the  current  volume  explores  the  use  of  weighted  abduction  for  large  scale  dis¬ 
course  processing. 

The  last  decade  has  seen  a  rapid  growth  in  the  area  of  Statistical  Relational 
Learning  (SRL)  (Getoor  and  Taskar,  2007),  which  uses  well-founded  probabilistic 
methods  while  maintaining  the  representational  power  of  first-order  logic.  Since 
these  models  combine  the  strengths  of  both  first-order  logic  and  probabilistic 
graphical  models,  we  believe  that  they  are  well  suited  for  solving  problems  like 
plan  recognition.  In  this  paper,  we  explore  the  efficacy  of  different  SRL  mod¬ 
els  for  the  task  of  plan  recognition.  We  focus  on  extending  two  specific  SRL 
models,  Markov  Logic  Networks  (MLNs)  (Domingos  and  Lowd,  2009)  (based  on 
undirected  probabilistic  graphical  models)  and  Bayesian  Logic  Programs  (BLPs) 
(Kersting  and  De  Raedt,  2001)  (based  on  directed  probabilistic  graphical  models) 
to  the  task  of  plan  recognition. 

MLNs  attach  real-valued  weights  to  formulas  in  first-order  logic  in  order  to  rep¬ 
resent  their  certainty.  They  effectively  use  logic  as  a  compact  template  for  repre¬ 
senting  large,  complex  ground  Markov  networks.  Since  MLNs  have  been  shown 
to  formally  subsume  many  other  SRL  models  and  have  been  successfully  applied 
to  many  problems  (Domingos  and  Lowd,  2009),  we  chose  to  explore  their  appli¬ 
cation  to  plan  recognition.  However,  the  representational  power  and  flexibility 
offered  by  MLNs  come  at  a  cost  in  computational  complexity.  In  particular,  many 
problems  result  in  exponentially  large  ground  Markov  networks,  making  learning 
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and  inference  intractable  in  the  general  case. 

Pearl  (1988)  argued  that  causal  relationships  and  abductive  reasoning  from  ef¬ 
fect  to  cause  are  best  captured  using  directed  graphical  models  (Bayesian  net¬ 
works).  Since  plan  recognition  is  abductive  in  nature,  this  suggested  that  we  also 
explore  a  formalism  based  on  directed  models.  Therefore,  we  also  explored  the 
application  of  BLPs,  which  combine  first-order  Horn  logic  and  directed  graphical 
models,  to  plan  recognition.  BLPs  use  SLD  resolution  to  generate  proof  trees, 
which  are  then  used  to  construct  a  ground  Bayesian  network  for  a  given  query. 
This  approach  to  network  construction  is  called  knowledge  base  model  construc¬ 
tion  (KBMC).  Similar  to  BLPs,  prior  approaches  (Wellman  et  ah,  1992;  Ngo  and 
Haddawy,  1997)  also  employ  the  KBMC  technique  to  construct  ground  Bayesian 
networks  for  inference.  Another  approach  called  probabilistic  Horn  abduction 
(PHA)  (Poole,  1993)  performs  adductive  reasoning  using  first-order  knowledge 
bases  and  Bayesian  networks.  However,  since  the  BLP  framework  imposes  fewer 
constraints  on  representation,  both  with  respect  to  structure  as  well  as  the  parame¬ 
ters  (Kersting  and  De  Raedt,  2007)  and  since  it  provides  an  integrated  framework 
for  both  learning  and  inference,  we  decided  to  use  BLPs  as  opposed  to  PHA  or 
other  similar  formalisms. 

Logical  approaches  to  plan  recognition,  e.g.  (Kautz  and  Allen,  1986;  Ng  and 
Mooney,  1992),  typically  assume  a  knowledge  base  of  plans  and/or  actions  ap¬ 
propriate  for  planning,  but  not  specifically  designed  for  plan  recognition.  The 
advantage  of  this  approach  is  that  a  single  knowledge  base  is  sufficient  for  both 
automated  planning  and  plan  recognition.  Also,  knowledge  of  plans  and  actions 
sufficient  for  planning  is  usually  easier  to  develop  than  a  knowledge  base  espe¬ 
cially  designed  for  plan  recognition,  which  requires  specific  rules  of  the  form  “If 
an  agent  performs  action  A,  they  may  be  executing  plan  P.”  A  recent  MLN-based 
approach  to  plan/activity  recognition  (Sadilek  and  Kautz,  2010b,a),  requires  such 
manually -provided  plan-recognition  rules. 

Our  goal  is  to  develop  general-purpose  SRL-based  plan-recognition  systems  that 
only  require  the  developer  to  provide  a  knowledge-base  of  actions  and  plans  suf¬ 
ficient  for  planning,  without  the  need  to  engineer  a  knowledge-base  specifically 
designed  for  plan  recognition.  Plan  recognition  using  only  planning  knowledge 
generally  requires  abductive  logical  inference.  BLPs  use  purely  deductive  logical 
inference  as  a  pre-processing  step  to  the  full  blow-blown  probabilistic  inference. 
Encoding  the  planning  knowledge  directly  in  MLNs  does  not  support  abductive 
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reasoning  either.  Further,  using  the  standard  semanties  of  MLNs  of  grounding  the 
whole  theory  leads  to  a  blow-up  for  plan  reeognition  problems.  Consequently,  nei¬ 
ther  BLPs  or  MLNs  ean  be  direetly  used  in  their  eurrent  form  for  abduetive  plan 
reeognition.  Therefore,  this  paper  deseribes  re-eneoding  strategies  (for  MLNs)  as 
well  as  enhaneements  to  both  models  that  allow  them  to  utilize  logieal  abduetion. 
Our  other  goal  involves  developing  systems  that  are  eapable  of  learning  the  nee- 
essary  parameters  automatieally  from  data.  Sinee  both  BLPs  and  MLNs  provide 
algorithms  for  learning  both  the  strueture  and  the  parameters,  we  adapt  them  in 
our  work  to  develop  trainable  systems  for  plan  reeognition. 

The  main  eontributions  of  the  paper  are  as  follows: 


•  Adapt  SRL  models  like  BLPs  and  MLNs  to  plan  reeognition 

•  Introduee  Bayesian  Abduetive  Logie  Programs  (BALPs),  an  adaptation  of 
BLPs  that  utilizes  logieal  abduetion 

•  Propose  re-eneoding  strategies  for  faeilitating  abduetive  reasoning  in  MLNs 

•  Introduee  abduetive  Markov  logie,  an  adaptation  of  MLNs  whieh  eom- 
bines  re-eneoding  strategies  with  logieal  abduetion  to  eonstruet  the  ground 
Markov  network 

•  Experimentally  evaluate  the  relative  performanee  of  BALPs,  abduetive  MLNs 
(i.e.  using  re-eneoding  strategies  and  abduetive  model  eonstruetion),  tradi¬ 
tional  MLNs,  and  existing  plan-reeognition  methods  on  three  plan-reeognition 
benehmarks. 


The  rest  of  the  paper  is  organized  as  follows.  First,  we  provide  some  baekground 
on  logieal  abduetion,  BLPs,  and  MLNs.  Next,  we  present  our  extensions  to  both 
BLPs  and  MLNs  to  inelude  logieal  abduetion.  Finally,  we  present  an  extensive 
evaluation  of  our  approaehes  on  three  benehmark  datasets  for  plan  reeognition, 
eomparing  them  with  the  existing  state-of-the-art  for  plan  reeognition. 
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2.  Background 


2.1.  Logical  Abduction 

In  a  logical  framework,  abduction  is  usually  defined  as  follows  (Pople,  1973; 
Levesque,  1989;  Kakas  et  ak,  1993): 


•  Given:  Background  knowledge  B  and  observations  O,  both  represented  as 
sets  of  formulae  in  first-order  logic,  where  B  is  typically  restricted  to  Horn 
clauses  and  O  to  ground  literals. 

•  Find:  A  hypothesis  H,  also  a  set  of  logical  formulae  (typically  ground 
literals),  such  that  B  U  H  ^  1.  and  B  U  H  \=  O. 


Here,  |=  represents  logical  entailment  and  ±  represents  false,  i.e.  find  a  set  of 
assumptions  that  is  consistent  with  the  background  theory  and  explains  the  ob¬ 
servations.  There  are  generally  many  hypotheses  H  that  explain  a  given  set  of 
observations  O.  Following  Occam’s  Razor,  the  best  hypothesis  is  typically  de¬ 
fined  as  the  one  that  minimizes  the  number  of  assumptions,  \H\.  Given  a  set  of 
observations  Oi,  O2,  ....,  On,  the  set  of  abductive  proof  trees  is  computed  by  re¬ 
cursively  backchaining  on  each  Oi  until  every  literal  in  the  proof  is  either  proven 
or  assumed.  Logical  abduction  has  been  applied  to  tasks  such  as  plan  recognition 
and  diagnosis  (Ng  and  Mooney,  1992;  Peng  and  Reggia,  1990). 


2.2.  Bayesian  Logic  Programs 

Bayesian  logic  programs  (BLPs)  (Kersting  and  De  Raedt,  2001)  can  be  viewed 
as  templates  for  constructing  directed  graphical  models  (Bayes  nets).  Given  a 
knowledge  base  as  a  special  kind  of  logic  program,  standard  backward-chaining 
logical  deduction  (SLD  resolution)  is  used  to  automatically  construct  a  Bayes  net 
on  the  same  lines  as  knowledge  based  model  construction  (KBMC)  (Wellman 
et  ah,  1992;  Breese  et  ah,  1994).  More  specifically,  given  a  set  of  facts  and  a  query, 
all  possible  Hom-clause  proofs  of  the  query  are  constructed  and  used  to  build  a 
Bayes  net  for  answering  the  query.  Standard  probabilistic  inference  techniques 
are  then  used  to  compute  the  most  probable  answer. 
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More  formally,  a  BLP  consists  of  a  set  of  Bayesian  clauses,  definite  clauses  of  the 

form  A\Ai,  A2,  A3, . An,  where  n  >  0  and  A,  Ai,  A2,  A3, . ,An  are  Bayesian 

predicates  (defined  below).  A  is  called  the  head  of  the  clause  (head{c))  and 
{Ai,  A2,  ^3,....,^^)  is  the  body  {hody{c)).  When  n  =  0,  a  Bayesian  clause  is 
a  fact.  Each  Bayesian  clause  c  is  assumed  to  be  universally  quantified  and  range 
restricted,  i.e  variables{head}  C  variahles{hody},  and  has  an  associated  condi¬ 
tional  probability  distribution:  cpd{c)  =  P{head{c)\body{c)). 

A  Bayesian  predicate  is  a  predicate  with  a  finite  domain,  and  each  ground  atom  for 
a  Bayesian  predicate  represents  a  random  variable.  Associated  with  each  Bayesian 
predicate  is  a  combining  rule  such  as  noisy-or  or  noisy-and  that  maps  a  finite  set 
of  cpd%  into  a  single  cpd  (Pearl,  1988).  Let  A  be  a  Bayesian  predicate  defined  by 

two  Bayesian  clauses,  A\Ai,  A2,  A3, . An  and  A\Bi,  B2,  B3, . Bn,  where  cpdi 

and  cpd2  are  their  cpd’s.  Let  6*  be  a  substitution  that  satisfies  both  clauses.  Then,  in 
the  constructed  Bayes  net,  directed  edges  are  added  from  the  nodes  for  each  A^d 
and  BiO  to  the  node  for  A9.  The  combining  rule  for  A  is  used  to  construct  a  single 
cpd  for  A9  from  cpdi  and  cpd2.  The  probability  of  a  joint  assignment  of  truth 
values  to  the  final  set  of  ground  propositions  is  then  defined  in  the  standard  way 
for  a  Bayes  net:  P(X)  =  ■  P(XilPa(Xi)),  where  X  =  Xi,  X2,  ...,Xn  represents 

the  set  of  random  variables  in  the  network  and  Pa{Xi)  represents  the  parents  of 

X,. 

Once  a  ground  network  is  constructed,  standard  probabilistic  inference  methods 
can  be  used  to  answer  various  types  of  queries  (Roller  and  Lriedman,  2009).  Typ¬ 
ically,  we  would  like  to  compute  the  most  probable  explanation  (MPE),  which 
finds  the  joint  assignment  of  values  to  unobserved  nodes  in  the  network  that  max¬ 
imizes  the  posterior  probability  given  the  values  of  a  set  of  observed  nodes.  This 
type  of  inference  is  also  called  also  known  as  the  maximum  a  posteriori  (MAP) 
assignment  and  might  be  used  interchangeably  in  this  book.  We  would  also  like 
to  compute  the  marginal  probabilities  for  the  unobserved  nodes  given  the  values 
of  observed  nodes.  The  combining-rule  parameters  and  cpd  entries  for  a  BLP  can 
be  learned  automatically  from  data  using  techniques  proposed  by  Kersting  and  De 
Raedt  (2008). 
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2.3.  Markov  Logic  Networks 


Markov  logic  (Richardson  and  Domingos,  2006;  Domingos  and  Lowd,  2009)  is  a 
framework  for  combining  first-order  logie  and  undireeted  probabilistie  graphieal 
models  (Markov  networks).  A  traditional  first-order  knowledge  base  ean  be  seen 
as  a  set  of  hard  constraints  on  the  set  of  possible  worlds:  if  a  world  violates  even 
one  formula,  its  probability  is  zero.  In  order  to  soften  these  constraints,  Markov 
logic  attaches  a  weight  to  each  formula  in  the  knowledge  base.  A  formula’s  weight 
reflects  how  strong  a  constraint  it  imposes  on  the  set  of  possible  worlds.  Formally, 
an  MLN  is  a  set  of  pairs  {Fi,Wi),  where  Fi  is  a  first-order  formula  and  Wi  is  a 
real  number.  A  hard  clause  has  an  infinite  weight  and  aets  as  a  logical  constraint; 
otherwise,  it  is  a  soft  clause.  Given  a  set  of  constants,  an  MLN  defines  a  ground 
Markov  network  with  a  node  in  the  network  for  eaeh  ground  atom  and  a  feature  for 
eaeh  ground  elause.  The  joint  probability  distribution  over  a  set  of  boolean  vari¬ 
ables  X  =  {Xi,X2...)  corresponding  to  the  nodes  in  the  ground  Markov  network 
(i.e.  ground  atoms)  is  defined  as: 


(1) 


where  nj(x)  is  the  number  of  true  groundings  of  F^  in  x  and  Z  is  a  normalization 
term  obtained  by  summing  P{X  =  x)  over  all  values  of  X.  Therefore,  a  possible 
world  becomes  exponentially  less  likely  as  the  total  weight  of  the  ground  elauses 
it  violates  inereases. 

An  MLN  can  be  viewed  as  a  set  of  templates  for  eonstrueting  ground  Markov 
networks.  Different  sets  of  eonstants  produee  different  Markov  networks;  how¬ 
ever,  there  are  eertain  regularities  in  their  strueture  and  parameters  determined  by 
the  underlying  first-order  theory.  Like  in  BLPs,  once  the  ground  network  is  con¬ 
structed,  standard  probabilistie  inferenee  methods  (MPE  or  marginal  inferenee) 
ean  be  used  to  answer  queries.  MLN  weights  ean  be  learned  by  maximizing  the 
eonditional  log-likelihood  of  training  data  supplied  in  the  form  of  a  database  of 
true  ground  literals.  A  number  of  effleient  inference  and  learning  algorithms  that 
exploit  the  structure  of  the  network  have  also  been  proposed.  Domingos  and  Lowd 
(2009)  provide  details  on  these  and  many  other  aspeets  of  MLNs. 
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3.  Adapting  Bayesian  Logic  Programs 


Bayesian  Abductive  Logic  Programs  (BALPs)  are  an  adaptation  of  BLPs.  In  plan 
recognition,  the  known  facts  are  insufficient  to  support  the  derivation  of  deductive 
proof  trees  for  the  requisite  queries.  By  using  aZ^duction,  missing  literals  can  be 
assumed  in  order  to  complete  the  proof  trees  needed  to  determine  the  structure 
of  the  ground  network.  We  first  describe  the  abductive  inference  procedure  used 
in  BALPs.  Next  we  describe  how  probabilistic  parameters  are  specified  and  how 
probabilistic  inference  is  performed.  Finally,  we  discuss  how  parameters  can  be 
automatically  learned  from  data. 


3.1.  Logical  Abduction 

Let  Oi,  O2,  ....,  On  be  the  set  of  observations.  We  derive  a  set  of  most-specific 
abductive  proof  trees  for  these  observations  using  the  method  originally  proposed 
by  Stickel  (1988).  The  abductive  proofs  for  each  observation  literal  are  computed 
by  backchaining  on  each  Oi  until  every  literal  in  the  proof  is  proven  or  assumed. 
A  literal  is  said  to  be  proven  if  it  unifies  with  some  fact  or  the  head  of  some 
rule  in  the  knowledge  base,  otherwise  it  is  said  to  be  assumed.  Since  multiple 
plans/actions  could  generate  the  same  observation,  an  observation  literal  could 
unify  with  the  head  of  multiple  rules  in  the  knowledge  base.  For  such  a  literal,  we 
compute  alternative  abductive  proofs  for  each  such  rule.  The  resulting  abductive 
proof  trees  are  then  used  to  build  the  structure  of  the  Bayes  net  using  the  standard 
approach  for  BLPs. 

The  basic  algorithm  to  construct  abductive  proofs  is  given  in  Algorithm  1 .  The 
algorithm  takes  as  input  a  knowledge  base  (KB)  in  the  form  of  Horn  clauses  and 
a  set  of  observations  as  ground  facts.  It  outputs  a  set  of  abductive  proof  trees 
by  performing  logical  abduction  on  the  observations.  These  proof  trees  are  then 
used  to  construct  the  Bayesian  network.  For  each  observation  Oi,  Abduction- 
BALP  searches  for  rules  whose  consequents  unify  with  Oi.  For  each  such  rule,  it 
computes  the  substitution  from  the  unification  process  and  substitutes  variables  in 
the  body  of  the  rule  with  bindings  from  the  substitution.  The  literals  in  the  body 
now  become  new  subgoals  in  the  inference  process.  If  these  new  subgoals  can¬ 
not  be  proved,  i.e.  if  they  cannot  unify  with  existing  facts  or  with  the  consequent 
of  any  rule  in  the  KB,  then  they  are  assumed.  In  order  to  minimize  the  number 
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Algorithm  1  AbductionBALP 

Inputs:  Background  knowledge  KB  and  observations  Oi,  O2,  O3,  On  both 
represented  as  sets  of  formulae  in  first-order  logie,  where  KB  is  typieally 
restrieted  to  a  set  of  Horn  elauses  and  eaeh  Oi  is  a  ground  literal. 

Output:  Abduetive  proofs  for  all  Oi. 

1:  Let  Q  be  a  queue  of  unproven  atoms,  initialized  with  Oi 
2:  while  Q  not  empty  do 
3;  Aj  <(—  Remove  atom  from  Q 
4:  for  eaeh  rule  Ri  in  KB  do 

5:  consequent  Head  literal  of  Ri 

6:  if  Ai  unifies  with  consequent  then 

7:  Si  ■(—  unify  Aj  and  consequent  and  return  substitution 

8:  Replaee  variables  in  the  body  of  Ri  with  bindings  in  S'j.  Eaeh  literal 

in  the  body  of  Ri  is  a  new  subgoal. 

9:  for  eaeh  liter aU  in  body  of  Ri  do 

10:  if  literali  unifies  with  head  of  some  rule  Rj  in  KB  then 

11:  add  literali  to  Q 

12:  else  if  literali  unifies  with  an  existing  faet  then 

13:  Unify  and  eonsider  the  literal  to  be  proved 

14:  else 

15:  if  literali  unifies  with  an  existing  assumption  then 

16:  Unify  and  use  the  assumption 

17:  else 

18:  Assume  literali  by  replaeing  any  unbound  variables  that  are 

existentially  quantified  in  literali  with  new  Skolem  eonstants. 

19:  end  if 

20:  end  if 

21:  end  for 

22:  end  if 

23:  end  for 

24:  end  while 
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(a)  Partial  knowledge  base  with  two  rules  for  road  block: 

1.  blk_rd(Loc)|hvy_snow(Loc), drive_hzrd(Loc). 

2.  blk_rd(Loc)|acdnt(Loc),  clr_wrck(Crew,  Loc). 

(b)  Observations: 
blk_rd(plaza) 

(c)  Ground  Abductive  Clauses: 

blk_rd(plaza)  |hvy_snow(plaza) ,  drive  Jizrd(plaza) . 
blk_rd(plaza)  |  acdnt  (plaza) ,  clr_wr ck(al ,  plaza) . 


Figure  1 :  (a)  A  partial  knowledge  base  from  the  emergency  response  domain  in  the  Monroe  data 
set.  All  variables  start  with  uppercase  and  constants  with  lowercase,  (b)  The  logical  representation 
of  the  observations,  (c)  The  set  of  ground  rules  obtained  from  logical  abduction. 

of  assumptions,  the  assumed  literals  are  first  matched  with  existing  assumptions. 
If  no  such  assumption  exists,  then  any  unbound  variables  in  the  literal  that  are 
existentially  quantified  are  replaced  by  Skolem  constants. 

In  SLD  resolution,  which  is  used  in  BLPs,  if  any  subgoal  literal  cannot  be  proven, 
the  proof  fails.  However,  in  BALPs,  we  assume  such  literals  and  allow  proofs  to 
proceed  till  completion.  Note  that  there  could  be  multiple  existing  assumptions 
that  could  unify  with  subgoals  in  Step  15.  However,  if  we  used  all  ground  assump¬ 
tions  that  could  unify  with  a  literal,  then  the  size  of  the  ground  network  would 
grow  exponentially,  making  probabilistic  inference  intractable.  In  order  to  limit 
the  size  of  the  ground  network,  we  unify  subgoals  with  assumptions  in  a  greedy 
manner,  i.e.  when  multiple  assumptions  match  with  a  subgoal,  we  randomly  pick 
one  of  them  and  do  not  pursue  the  others.  We  found  that  this  approach  worked  well 
for  our  plan-recognition  benchmarks.  For  other  tasks,  domain-specific  heuristics 
could  potentially  be  used  to  reduce  the  size  of  the  network. 

We  now  illustrate  the  abductive  inference  process  with  a  simple  example  mo¬ 
tivated  by  one  of  our  evaluation  benchmarks,  the  emergency  response  domain 
introduced  by  Blaylock  and  Allen  (2005)  in  the  Monroe  data  set  described  in  Sec¬ 
tion  5.1.  Consider  the  partial  knowledge  base  and  set  of  observations  given  in  Fig- 
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Figure  2:  Bayesian  network  constructed  for  example  in  Figure  1.  The  nodes  with  thick  borders 
represent  observed  actions,  the  nodes  with  dotted  borders  represent  intermediate  nodes  used  to 
combine  the  conjuncts  in  the  body  of  a  clause,  and  the  nodes  with  thin  borders  represent  plan 
literals. 

ure  la  and  Figure  lb  respectively.  The  knowledge  base  consists  of  rules  that  give 
two  explanations  for  a  road  being  blocked  at  a  location:  1)  there  has  been  heavy 
snow  resulting  in  hazardous  driving  conditions,  and  2)  there  has  been  an  accident 
and  the  crew  is  clearing  the  wreck.  Given  the  observation  that  a  road  is  blocked, 
the  plan  recognition  task  involves  abductively  inferring  one  of  these  causes  as  the 
explanation.  For  each  observation  literal  in  Figure  lb,  we  recursively  backchain  to 
generate  abductive  proof  trees.  In  the  given  example,  we  observe  that  the  road  is 
blocked  at  the  location  plaza.  When  we  backchain  on  the  literal  blk_rd(plaza) 
using  Rule  1,  we  obtain  the  subgoals  hvy_snow(plaza)  and  drive_hzrd(plaza). 
These  subgoals  become  assumptions  since  no  observations  or  heads  of  clauses 
unify  with  them.  We  then  backchain  on  the  literal  blk_rd(plaza)  using  Rule  2  to 


12 


obtain  subgoals  acdnt(plaza)  and  clr_wrk(Crew,  plaza).  Here  again,  we  find 
that  these  subgoals  have  to  be  assumed  since  there  are  no  facts  or  heads  of  clauses 
that  unify  with  them.  We  further  notice  that  clr_wrk(Crew,  plaza)  is  not  a  fully 
ground  instance.  Since  Crew  is  an  existentially  quantified  variable,  we  replace  it 
with  a  Skolem  constant  al  to  get  the  ground  assumption  clr_wrk(al,  plaza). 

Figure  Ic  gives  the  final  set  of  ground  rules  generated  by  abductive  inference. 
After  generating  all  abductive  proofs  for  all  observation  literals,  we  construct 
a  Bayesian  network.  Figure  2  shows  the  Bayesian  network  constructed  for  the 
example  in  Figure  1.  Note  that  since  there  are  no  observations/facts  that  unify 
with  the  subgoals  (hvy_snow(plaza),  drive_hzrd(plaza),  acdnt(plaza),  and 
clr_wrk(Crew,  plaza))  generated  during  backchaining  on  observations,  SLD  res¬ 
olution  will  fail  to  generate  proofs.  This  is  typical  in  plan  recognition,  and  as  a 
result,  we  cannot  use  BLPs  for  such  tasks. 


3.2.  Probabilistic  Modeling,  Inference,  and  Learning 

The  only  difference  between  BALPs  and  BLPs  lies  in  the  logical  inference  pro¬ 
cedure  used  to  construct  proofs.  Once  the  abductive  proofs  are  generated,  BALPs 
use  the  same  procedure  as  BLPs  to  construct  the  Bayesian  network.  Further,  tech¬ 
niques  developed  for  BLPs  for  learning  parameters  can  also  be  used  for  BALPs. 

We  now  discuss  how  parameters  are  specified  in  BALPs.  We  use  noisy/logical- 
and  and  noisy-or  models  to  specify  the  cpd%  in  the  ground  Bayesian  network  as 
these  models  compactly  encode  the  cpd  with  fewer  parameters,  i.e.  just  one  pa¬ 
rameter  for  each  parent  node.  Depending  on  the  domain,  we  use  either  a  strict 
logical-and  or  a  softer  noisy-and  model  to  specify  the  cpd  for  combining  evidence 
from  the  conjuncts  in  the  body  of  a  clause.  We  use  a  noisy-or  model  to  specify 
the  cpd  for  combining  the  disjunctive  contributions  from  different  ground  clauses 
with  the  same  head.  Figure  2  shows  the  logical-and  and  noisy-or  nodes  in  the 
Bayesian  network  constructed  for  the  example  in  Figure  1.  Given  the  constructed 
Bayesian  network  and  a  set  of  observations,  we  determine  the  best  explanation 
using  the  most  probable  explanation  (MPE)  inference  (Pearl,  1988).  We  compute 
multiple  alternative  explanations  using  the  k-MPE  algorithm  (Nilsson,  1998)  as 
implemented  in  the  ELVIRA  Elvira-Consortium  (2002)  package. 

Learning  can  be  used  to  automatically  set  the  noisy-or  and  noisy-and  parameters 
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in  the  model.  In  supervised  training  data  for  plan  reeognition,  one  typieally  has 
evidenee  for  the  observed  aetions  and  the  top-level  plans.  However,  we  usually 
do  not  have  evidenee  for  network  nodes  eorresponding  to  subgoals,  noisy-ors, 
and  noisy/logieal-ands.  As  a  result,  there  are  a  number  of  variables  in  the  ground 
networks  whieh  are  always  hidden,  and  henee  EM  is  appropriate  for  learning  the 
requisite  parameters  from  the  partially  observed  training  data.  We  use  the  EM 
algorithm  adapted  for  BLPs  by  Kersting  and  De  Raedt  (2008).  We  simplify  the 
problem  by  learning  only  noisy-or  parameters  and  using  a  deterministie  logieal- 
and  model  to  eombine  evidenee  from  the  eonjunets  in  the  body  of  a  elause. 


4.  Adapting  Markov  Logic 


As  previously  mentioned,  eneoding  the  planning  knowledge  direetly  in  MENs 
does  not  support  abduetive  reasoning.  This  is  beeause  of  the  deduetive  nature 
of  the  rules  eneoding  the  planning  knowledge.  In  MENs,  the  probability  of  a 
possible  world  inereases  with  the  total  weight  of  the  satisfied  formulae.  Sinee 
an  implieation  is  satisfied  whenever  its  eonsequent  is  true,  an  MEN  is  unable  to 
abduetively  infer  the  antecedent  of  a  rule  from  its  consequent.  Given  the  rule 
P  ^  Q  and  the  observation  that  Q  is  true,  we  would  like  to  abduce  P  as  a 
possible  cause  for  Q.  Since  the  consequent  {Q)  is  true,  the  clause  is  trivially 
satisfied,  independent  of  the  value  of  the  antecedent  (P),  and  hence  does  not  give 
any  information  about  the  truth  value  of  the  antecedent  (P). 

In  this  section,  we  describe  three  key  ideas  for  adapting  MENs  with  logical  abduc- 
tive  reasoning,  each  one  building  on  the  previous.  Eirst,  we  describe  the  Pairwise 
Constraint  (PC)  model  proposed  by  Kate  and  Mooney  (2009).  Next,  we  introduce 
the  Hidden  Cause  (HC)  model,  which  alleviates  some  of  the  inefficiencies  of  the 
PC  model.  These  two  models  offer  strategies  for  the  re-encoding  MEN  rules  but 
do  not  change  the  semantics  of  the  traditional  MENs  (Richardson  and  Domingos, 
2006).  Next,  we  introduce  an  abduetive  model  construction  procedure  on  top  of 
the  HC  model  that  results  in  even  simpler  Markov  networks.  This  gives  us  the 
formulation  for  abduetive  Markov  logic.  Our  ground  Markov  network  construc¬ 
tion  strategy  is  different  from  the  one  used  in  traditional  MENs,  and  hence,  our 
formulation  results  in  a  different  semantics. 

We  also  describe  an  alternate  approach  to  plan  recognition  in  which  the  structure 
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of  the  MLN  is  manually  encoded  to  enable  deductive  inference  of  the  top-level 
plans  from  observed  actions.  This  allows  us  to  compare  abductive  Markov  logic 
to  a  manually  encoded  MLN  for  plan  recognition. 


4.1.  Pairwise  Constraint  Model 

Kate  and  Mooney  (2009)  were  the  first  to  develop  an  approach  to  re-encode 
MLNs  with  logical  abductive  reasoning,  which  we  call  the  Pairwise  Constraint 
(PC)  model.  We  describe  this  approach  here  since  it  provides  the  context  for 
understanding  the  more  sophisticated  models  introduced  in  subsequent  sections. 
The  key  idea  is  to  introduce  explicit  reversals  of  the  implications  appearing  in 
the  original  knowledge  base.  Multiple  possible  explanations  for  the  same  ob¬ 
servation  are  supported  by  having  a  disjunction  of  the  potential  explanations  in 
the  reverse  implication.  “Explaining  away”  (Pearl,  1988)  (inferring  one  cause 
eliminates  the  need  for  others)  is  achieved  by  introducing  a  mutual-exclusivity 
constraint  between  every  pair  of  possible  causes  for  an  observation.  Given  the 
set  of  Horn  clauses:  Pi  ^  Q,  P2  ^  Qr  "  Pn  ^  Q,  ^  reverse  implication: 
Q  ^  Pi  V  P2  •  •  ■  V  Pn,  and  a  set  of  mutual-exclusivity  constraints:  Q  ^  -iPi  V 
-'P2,  ■  ■  -  Q  ^  “'Pn-i  V  -iP^  for  all  pairs  of  explanations  are  introduced.  The 
weights  on  these  clauses  control  the  strength  of  the  abductive  inference  and  the 
typical  number  of  alternate  explanations,  respectively.  We  do  not  need  to  ex¬ 
plicitly  model  these  constraints  in  BLPs,  since  the  underlying  model  is  Bayesian 
networks  which  capture  the  full  conditional  probability  distribution  (CPD)  of  each 
node  given  its  parents  and  the  mutual  exclusivity  constraints  are  implicitly  mod¬ 
eled  in  the  conditional  distribution.  For  first-order  Horn  clauses,  all  variables  not 
appearing  in  the  head  of  the  clause  become  existentially  quantified  in  the  reverse 
implication.  We  refer  the  reader  to  Kate  and  Mooney  (2009)  for  the  details  of  the 
conversion  process. 

We  now  illustrate  the  PC  model  with  the  same  example  described  in  Section  3.  It 
is  an  example  from  one  of  our  evaluation  benchmarks,  the  emergency  response  do¬ 
main  introduced  by  Blaylock  and  Allen  (2005)  (all  variables  start  with  uppercase 
and  constants  with  lowercase,  and  by  default  variables  are  universally  quantified): 

hvy_snow(Loc)  A  drive_hzrd(Loc)  ^  blk_rd(Loc) 
acdnt(Loc)  A  clr_wrk(Crew,  Loc)  ^  blk_rd(Loc) 
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These  rules  give  two  explanations  for  a  road  being  bloeked  at  a  location:  1)  there 
has  been  heavy  snow  resulting  in  hazardous  driving  conditions,  and  2)  there  has 
been  an  accident  and  the  crew  is  clearing  the  wreck.  Given  the  observation  that 
a  road  is  blocked,  the  plan  recognition  task  involves  abductively  inferring  one  of 
these  causes  as  the  explanation.  Using  the  PC  model,  we  get  the  final  combined 
reverse  implication  and  pairwise  constraint  clauses  as  below: 

blk_rd(Loc)  (hvy_snow(Loc)  A  drive_hzrd(Loc))V 
(3Crew  acdnt(Loc)  A  clr_wrk(Crew,  Loc)) 
blk_rd(Loc)  ^  -i(hvy_snow(Loc)  A  drive_hzrd(Loc))V 
-i(3Crew  acdnt(Loc)  A  clr_wrk(Crew,  Loc)) 

The  first  rule  introduces  the  two  possible  explanations  and  the  second  rule  con¬ 
strains  them  to  be  mutually  exclusive. 

The  PC  model  can  construct  very  complex  networks  since  it  includes  multiple 
clause  bodies  in  the  reverse  implication,  making  it  very  long.  If  there  are  n  pos¬ 
sible  causes  for  an  observation  and  each  of  the  corresponding  Horn  clause  has  k 
literals  in  its  body,  then  the  reverse  implication  has  0{nk)  literals.  This  in  turn  re¬ 
sults  in  cliques  of  size  0{nk)  in  the  ground  network.  This  significantly  increases 
the  computational  complexity  since  probabilistic  inference  is  exponential  in  the 
treewidth  of  the  graph,  which  in  turn  is  at  least  the  size  of  the  maximum  clique 
(Koller  and  Friedman,  2009).  The  PC  model  also  introduces  0{n^)  pairwise  con¬ 
straints,  which  can  result  in  a  large  number  of  ground  clauses.  As  a  result,  the  PC 
model  does  not  generally  scale  well  to  large  domains. 


4.2.  Hidden  Cause  Model 

The  Hidden  Cause  (HC)  model  alleviates  some  of  the  inefficiencies  of  the  PC 
model  by  introducing  a  hidden  cause  node  for  each  possible  explanation.  The 
joint  constraints  can  then  be  expressed  in  terms  of  these  hidden  causes,  thereby 
reducing  the  size  of  the  reverse  implication  (and  hence,  the  corresponding  clique 
size)  to  0{n).  The  need  for  the  pairwise  constraints  is  eliminated  by  specifying 
a  low  prior  on  all  hidden  causes.  A  low  prior  on  the  hidden  causes  indicates  that 
the  hidden  causes  are  most  likely  to  be  false,  unless  there  is  explicit  evidence 
indicating  the  their  presence.  Hence,  given  an  observation,  inferring  one  cause 
obviates  the  need  for  the  others.  We  now  describe  the  HC  model  more  formally. 
We  first  consider  the  propositional  case  for  the  ease  of  explanation.  It  is  extended 


16 


to  first-order  Horn  clauses  in  a  straightforward  manner.  Consider  the  following 
set  of  rules  describing  the  possible  explanations  for  a  predicate  Q: 

A  Pi2  A  •  •  •  A  Piki  =>  Q,  VI,  (1  <  1  <  n) 

For  each  rule  we  introduce  a  hidden  cause  Q  and  add  the  following  rules  to  the 
MLN: 

1.  ^  Piki  ^  Ci,  VI,  (,l<i<n)  (soft) 

2.  Ci  Q,  Mi,  (1  <i  <n)  (hard) 

3.  Q  ^  CiM  C2  -  ■  ■  Cn  (reverse  implication)  (hard) 

4.  true  ^  Ci,  Mi,  (I  <  i  <  n)  (negatively  weighted)  (soft) 


The  first  set  of  rules  are  soft  clauses  with  high  positive  weights.  This  allows 
the  antecedents  to  sometimes  fail  to  cause  the  consequent  (and  vice-versa).  The 
next  two  sets  of  rules  are  hard  clauses,  they  implement  a  deterministic-or  function 
between  the  consequent  and  the  hidden  causes.  The  last  one  is  a  soft  rule  and 
implements  a  low  prior  (by  having  a  negative  MLN  weight)  on  the  hidden  causes. 
These  low  priors  discourage  inferring  multiple  hidden  causes  for  the  same  conse¬ 
quent  (“explaining  way”),  and  the  strength  of  the  prior  determines  the  degree  to 
which  multiple  explanations  are  allowed. 

Different  sets  of  weights  on  the  biconditional  in  the  first  set  of  rules  implement  dif¬ 
ferent  ways  of  combining  multiple  explanations.  For  example,  a  noisy-or  model 
(Pearl,  1988)  can  be  implemented  by  modeling  the  implication  from  antecedents 
to  the  hidden  cause  as  a  soft  constraint  and  the  reverse  direction  as  a  hard  con¬ 
straint.  The  weight  Wi  for  the  soft-constraint  is  set  to  log[{l  —  Pfi)/Pfi],  where  py. 
is  the  failure  probability  for  cause  i.  This  formulation  is  related  to  previous  work 
on  combining  functions  in  Markov  logic  (Natarajan  et  ah,  2010);  however,  we 
focus  on  the  use  of  such  combining  functions  in  the  context  of  abductive  reason¬ 
ing.  There  has  also  been  prior  work  on  discovering  hidden  structure  in  a  domain 
(e.g.  (Davis  et  ah,  2007;  Kok  and  Domingos,  2007)).  However,  in  our  case,  since 
we  know  the  structure  of  the  hidden  predicates  in  advance,  there  is  no  need  to 
discover  them  using  the  methods  referenced  above. 

We  now  describe  how  to  extend  this  approach  to  first-order  Horn  clauses.  For  first- 
order  Horn  clauses,  variables  present  in  the  antecedents  but  not  in  the  consequent 
become  existentially  quantified  in  the  reverse  implication,  as  in  the  PC  model.  But 
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(tcrew, plaza) 


blk_rd 
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Figure  3:  Ground  network  constructed  by  the  PC  model  for  the  road  blocked  example 


unlike  the  PC  model,  the  reverse  implieation  expression  is  mueh  simpler  as  it  only 
involves  one  predicate  (the  hidden  cause)  for  each  rule  implying  the  consequent. 
Let  us  revisit  the  example  from  the  emergency  response  domain.  Based  on  the 
HC  approach,  we  introduce  two  hidden  causes  (rb_Cl(Loc)  and  rb_C2(Loc))  cor¬ 
responding  to  the  two  (hard)  rules: 

hvy_snow(Loc)  A  drive_hzrd(Loc)  rb_Cl(Loc) 
acdnt(Loc)  A  clr_wrk(Crew,  Loc)  rb_C2(Crew,  Loc) 

Note  that  each  hidden  cause  contains  all  variables  present  in  the  antecedent  of 
the  rule.  These  hidden  causes  are  combined  with  the  original  consequent  using 
the  following  (soft)  rules: 

rb_Cl(Loc)  ^  blk_rd(Loc) 

rb_C2(Crew,  Loc)  ^  blk_rd(Loc) 

blk_rd(Loc)  ^  rb_Cl(Loc)  V  3Crew(rb_C2(Crew,  Loc)) 

In  addition,  there  are  (soft)  unit  clauses  specifying  low  priors  on  the  hidden  causes. 

Figures  3  and  4  show  the  ground  networks  constructed  by  the  PC  and  HC  models 
respectively,  when  loc  is  bound  to  Plaza  and  crew  to  Tcrew.  The  PC  model 
results  in  a  fully  connected  graph  (maximum  clique  size  is  5),  whereas  the  HC 
model  is  much  sparser  (maximum  clique  size  is  3).  Consequently,  inference  in  the 
HC  model  is  significantly  more  efficient. 


18 


hvy_snow 

(plaza) 


drive_hzrd 

(plaza) 


acdnt 

(plaza) 


clr_wrk 

(tcrew,plaza) 


Figure  4:  Ground  network  constructed  by  the  HC  model  for  the  road  blocked  example 

Algorithm  2  presents  the  pseudoeode  for  eonstrueting  the  re-eneoded  MLN  using 
the  HC  approaeh  for  a  given  Horn-elause  knowledge  base.  In  lines  2  to  8,  hid¬ 
den  eauses  are  ereated  for  eaeh  possible  explanation  for  eaeh  eonsequent  (line  5). 
A  bieonditional  is  introdueed  between  the  hidden  eauses  and  the  eorresponding 
anteeedents  (line  6),  whieh  is  modeled  as  a  soft  elause  in  the  MLN.  Eaeh  hidden 
eause  is  also  linked  to  the  eorresponding  eonsequent  via  a  hard  elause  (line  7).  The 
next  part  (lines  9  to  24)  combines  the  hidden  causes  for  each  of  the  consequents 
into  a  single  reverse  implication.  The  rules  are  partitioned  according  to  the  first- 
order  predicate  appearing  in  the  consequent  (line  9).  For  each  partition  (line  10), 
each  possible  instantiation  of  the  consequent  predicate  appearing  in  the  underlying 
rules  is  considered  (lines  11  to  13).  For  instance,  given  the  rule:  hi(X,  Y)  ^  q(Y) 
and  another:  h2(X,  const)  ^  q(const),  we  need  to  consider  each  instantiation  of 
q(X)  and  q(const)  separately.  For  each  such  instantiation  c  (line  14),  we  consider 
the  rules  which  could  result  in  the  consequent  c  being  true  (line  15).  Technically, 
these  are  rules  whose  consequents  subsume  c,  i.e.  there  exists  a  substitution 
such  that  c  =  C{ri)9i,  where  C(rj)  is  the  consequent  of  rule  ri.  These  rules  result 
in  c  when  bound  by  the  substitution  Oi.  For  each  such  rule  g  (line  17),  substitu¬ 
tion  6i  is  applied  to  the  corresponding  hidden  cause  H{ri)  (line  18).  Then,  the 
set  of  free  variables  in  the  hidden  cause  (i.e.  the  variables  not  appearing  in  c) 
is  extracted  (line  19).  These  variables  are  existentially  quantified  in  the  reverse 
implication  (next  step).  We  then  introduce  a  reverse  implication  to  indicate  that  c 
implies  at  least  one  of  the  consequents  (amongst  those  that  subsume  c)  (line  21). 
This  reverse  implication  is  made  a  hard  clause  and  added  to  the  MEN  (line  22). 
Finally,  a  low  prior  is  introduced  for  each  hidden  cause  (lines  25  to  27). 
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Algorithm  2  GenReEncodedMLN(KB) 
inputs:  KB,  background  knowledge  consisting  of  Horn  clauses 
output:  M,  set  of  rules  in  the  re-encoded  MLN 
1:  M  ^  {} 

2:  for  all  r  G  KB  do 
3;  A(r)  ■<—  antecedent  in  r 
4;  C{r)  ^  consequent  in  r 
5:  H (r)  ^  hidden  cause  for  r 

6;  M^MU{A{r)  ^  H{r)} 

7:  M  ^  MU  {H{r)  ^  C{r)} 

8:  end  for 

9:  Part(KB)  partition  of  KB  into  sets  or  rules  with  same 
(first-order)  predicate  in  the  consequent 
10:  for  all  set  of  rules  R  G  Part(KB)  do 
11:  LetR  =  {ri,r2,---rm} 

m 

12:  C{R)  ^ 

i=l 

13:  (C(i?)  is  set  of  unique  consequents  appearing  in  R) 

14:  for  all  c  G  C{R)  do 

15:  Rc  ^  {n  e  R  I  30i,  c  =  C{ri)6i} 

16:  (Re  is  set  of  rules  whose  consequents  subsume  c) 

17:  for  all  r*  G  Rc  do 

18:  Hg^{ri)  ^  H{ri)9i 

19:  {vi^ ,  ,  •  •  •  Ujj, }  ^  variables  appearing  in 

Hoiiri)  but  not  in  c 

20:  end  for 

m 

21.  If  {Rc)  (c  ,  Vi2 )  ■  ■  ■  (rj) ) 

i=l 

22:  M^MU{IriRc)} 

23:  end  for 

24:  end  for 

25:  for  all  r  G  KB  do 

26:  M  M  U  {true  H (r)  (negatively  weighted)  } 

27:  end  for 
28:  return  M 


4.3.  Abductive  Model  Construction 

Abductive  reasoning  using  MLNs  consists  of  the  following  3  steps:  1)  Generate 
the  re-encoded  MLN,  2)  Construct  the  ground  Markov  network  (model  construc- 
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tion),  3)  Perform  learning/inference  over  the  resulting  ground  network.  The  stan¬ 
dard  MLN  model  eonstruetion  proeess  uses  the  set  of  all  possible  ground  atoms 
(the  Herbrand  base)  and  eonstruets  the  set  of  all  possible  ground  elauses  using 
these  atoms.  Using  logieal  rules  to  eonstruet  a  graphieal  model  is  generally  re¬ 
ferred  to  as  knowledge-based  model  construction  (KBMC),  originally  proposed 
by  Wellman  (Wellman  et  ah,  1992;  Breese  et  ah,  1994).  This  idea  was  further  used 
by  Ngo  and  Haddawy  (1997)  for  answering  queries  from  probabilistie  knowledge 
bases.  In  abduetive  reasoning,  we  are  looking  for  explanations  for  a  given  set  of 
observations.  In  this  task,  the  set  of  all  possible  ground  atoms  and  ground  clauses 
may  not  be  needed  to  explain  the  observations.  Considering  the  fully  ground 
network  leads  to  increased  time  and  memory  complexity  for  learning  and  infer¬ 
ence.  For  instance,  in  the  road  blocked  example,  if  the  observation  of  interest  is 
blk_rd(plaza),  then  we  can  ignore  groundings  where  the  location  is  not  plaza. 

We  propose  an  alternative  model-construction  procedure  that  uses  logical  abduc¬ 
tion  to  determine  the  set  of  relevant  ground  atoms.  The  procedure  first  constructs 
the  set  of  abduetive  proof  trees  for  the  observations  and  then  uses  only  the  ground 
atoms  in  these  proofs  instead  of  the  full  Herbrand  base  for  constructing  the  ground 
network.  The  ground  Markov  network  is  then  constructed  by  instantiating  the  for¬ 
mulae  in  the  abduetive  MLN  using  this  reduced  set  of  ground  atoms.  We  refer 
to  the  set  of  ground  atoms  (Markov  network)  thus  constructed  as  the  abduetive 
ground  atoms  (Markov  network).  First,  given  a  set  of  Horn  rules  and  a  set  of  ob¬ 
servations,  the  rules  for  the  abduetive  MLN  are  constructed  using  the  HC  model. 
Next,  the  set  of  most-specific  abduetive  proof  trees  for  the  observations  are  com¬ 
puted  using  the  method  of  Stickel  (1988).  The  abduetive  inference  process  to  get 
the  set  of  most-specific  abduetive  proofs  is  described  in  Algorithm  1 .  The  atoms  in 
these  proofs  form  the  set  of  abduetive  ground  atoms.  For  each  formula  in  the  ab- 
ductive  MLN,  the  set  of  all  ground  formulae  whose  atoms  appear  in  the  abduetive 
ground  atoms  are  added  to  the  ground  Markov  network.  While  handling  existen- 
tials,  only  those  disjuncts  which  belong  to  the  set  of  abduetive  ground  atoms  are 
used.  Learning  and  inference  are  then  performed  over  the  resulting  network. 

In  general,  the  abduetive  model  construction  procedure  results  in  a  ground  net¬ 
work  that  is  substantially  different  (and  usually  much  simpler)  from  that  con¬ 
structed  using  the  full  Herbrand  base.  It  also  differs  from  the  network  constructed 
by  starting  KBMC  from  the  query/observations  (Domingos  and  Lowd,  2009)  be¬ 
cause  of  the  use  of  the  backward  chaining  and  unification  during  the  abduc- 
tive  model  construction.  Consequently,  the  probabilistic  inferences  supported  by 
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this  model  can  be  different  from  that  of  the  traditional  MLN  model.  This  also 
makes  the  abductive  process  different  from  other  pre-processing  approaches  such 
as  Shavlik  and  Natarajan  (2009),  or  existing  lifted  inference  techniques  such  as 
Singla  and  Domingos  (2008),  both  of  which  produce  a  network  that  is  proba¬ 
bilistically  equivalent  to  the  original.  By  focusing  on  the  relevant  ground  atoms, 
abductive  model  construction  significantly  improves  the  performance  of  abductive 
MLNs  both  in  terms  of  time  and  memory  efficiency  as  well  as  predictive  accuracy. 
Further,  lifted  inference  could  still  be  applied  by  constructing  a  lifted  network  over 
the  nodes/clauses  present  in  the  abductive  network.  For  consistency  with  Singla 
and  Mooney  (2011),  we  will  refer  to  abductive  MLNs  (using  the  Hidden  Cause 
model  followed  by  abductive  model  construction)  as  MLN-HCAM. 

Detailed  experiments  demonstrating  significantly  improved  plan-recognition  per¬ 
formance  (with  respect  to  both  accuracy  and  efficiency)  for  the  HC  model  and 
abductive  model  construction  are  presented  by  Singla  and  Mooney  (2011).  There¬ 
fore,  we  only  compare  with  the  MLN-HCAM  approach  in  the  experiments  below. 


4.4.  Plan  Recognition  using  Manually  Encoded  MLNs 

As  discussed  in  the  introduction,  traditional  (non-abductive)  MLNs  can  be  used 
for  plan  recognition  if  an  appropriate  set  of  clauses  are  manually  provided  that 
directly  infer  higher-level  plans  from  primitive  actions,  e.g.  (Sadilek  and  Kautz, 
2010b,a).  In  order  to  compare  to  this  approach,  we  developed  an  MLN  approach 
that  uses  a  manually-engineered  knowledge  base  to  perform  deductive  plan  recog¬ 
nition. 

The  clauses  for  this  manually  encoded  MLN  were  constructed  as  follows.  For 
each  top-level  plan  predicate,  we  identified  any  set  of  observations  that  uniquely 
identifies  this  plan.  This  implies  that  no  other  plan  explains  this  set  of  observa¬ 
tions.  We  then  introduced  a  rule  that  infers  this  plan  given  these  observations  and 
we  make  it  a  hard  clause  in  the  MLN.  If  no  such  observations  exist,  we  introduced 
a  soft  rule  for  each  observation  that  could  potentially  indicate  this  plan.  Hard 
mutual-exclusivity  rules  were  included  for  plans  when  we  know  only  one  of  them 
can  be  true.  Alternatively,  we  gave  a  negative  prior  to  all  plan  predicates  as  de¬ 
scribed  in  the  MLN-HCAM  model.  Note  that  this  approach  does  not  include  any 
predicates  corresponding  to  sub-goals  that  are  never  observed  in  the  data.  As  a 
result,  all  variables  in  this  model  are  fully  observed  during  training,  resulting  in  a 
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less  complex  model  for  learning  and  inference.  This  way  of  encoding  the  domain 
knowledge  avoids  the  automated  machinery  for  introducing  reverse  implications, 
and  can  potentially  lead  to  a  simpler  knowledge  base,  which  results  in  simpler 
ground  networks.  However,  it  requires  a  separate  knowledge-engineering  process 
that  develops  explicit  plan-recognition  rules  for  the  domain,  while  the  abductive 
approach  only  requires  basic  knowledge  about  plans  and  actions  sufficient  for 
planning.  There  are  several  ways  in  which  such  explicit  plan-recognition  knowl¬ 
edge  bases  can  be  manually  engineered,  and  we  have  employed  just  one  possible 
approach.  Exploring  alternative  approaches  is  a  direction  for  future  work.  Sub¬ 
sequently,  we  refer  to  this  manually  encoded  traditional  MEN  model  as  “MEN- 
manual”. 


4.5.  Probabilistic  Modeling,  Inference,  and  Learning 

In  MEN-HC  and  MEN-HCAM,  we  use  the  noisy-or  model  to  combine  multi¬ 
ple  explanations.  In  all  MEN  models,  we  use  the  cutting-plane  method  (Riedel, 
2008)  for  MPE  inference  and  MC-SAT  (Poon  and  Domingos,  2006)  to  compute 
marginal  probabilities  of  plan  instances.  Eor  learning  weights  of  the  clauses  in 
MEN-HCAM,  we  use  a  version  of  gradient-based  voted-perceptron  algorithm 
(Singla  and  Domingos,  2005)  modified  for  partially  observed  data  as  discussed 
in  Chapter  20  (Section  3.3.1)  of  Roller  and  Eriedman  (2009).  Eor  the  traditional 
MEN  model,  gradient-based  voted-perceptron  algorithm  runs  out  of  memory  due 
to  the  size  and  complexity  of  the  resulting  ground  networks.  Therefore,  we  learned 
weights  using  the  discriminative  online  learner  proposed  by  Huynh  and  Mooney 
(2011a).'  More  details  about  various  settings  used  in  our  experiments  are  de¬ 
scribed  in  Section  5. 


5.  Experimental  Evaluation 


In  this  section,  we  present  an  extensive  experimental  evaluation  of  the  perfor¬ 
mance  of  BAEPs  and  MENs  on  three  plan-recognition  data  sets.  Unfortunately, 


'This  is  possible  since  the  training  data  for  this  model  is  fully  observed  (i.e  there  are  no  hidden 
nodes). 
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there  have  been  very  few  rigorous  empirieal  evaluations  of  plan-reeognition  sys¬ 
tems,  and  there  are  no  widely-used  benehmarks.  Our  experiments  employ  three 
extant  data  sets  and  eompare  to  the  state-of-the-art  results  in  order  to  demonstrate 
the  advantages  of  SRL  methods.  In  addition  to  presenting  eonerete  results  for 
BALPs  and  MLNs,  we  also  eonsider  their  relationship  to  other  probabilistie  log- 
ies,  diseussing  the  relative  strengths  and  weaknesses  of  different  SRL  models  for 
plan  reeognition. 


5.1.  Datasets 

5.1.1.  Story  Understanding 

We  first  used  a  small  dataset  previously  used  to  evaluate  abduetive  story  under¬ 
standing  systems  (Ng  and  Mooney,  1992;  Chamiak  and  Goldman,  1991).^  In  this 
task,  eharaeters’  higher-level  plans  must  be  inferred  from  their  aetions  deseribed 
in  a  narrative  text.  A  logieal  representation  of  the  literal  meaning  of  the  text  is 
given  for  eaeh  example.  A  sample  story  (in  English)  is:  “Bill  went  to  the  liquor- 
store.  He  pointed  a  gun  at  the  owner.”  The  plans  in  this  dataset  inelude  shopping, 
robbing,  restaurant  dining,  traveling  in  a  vehiele  (bus,  taxi  or  plane),  partying 
and  jogging.  Most  narratives  involve  more  than  a  single  plan.  This  small  dataset 
eonsists  of  25  development  examples  and  25  test  examples  eaeh  eontaining  an 
average  of  12.6  literals.  We  used  the  planning  baekground  knowledge  initially 
eonstrueted  for  the  ACCEL  system  (Ng  and  Mooney,  1992),  whieh  eontains  a 
total  of  126  Horn  rules. 


5.1.2.  Monroe 

We  also  used  the  Monroe  dataset,  an  artifieially  generated  plan-reeognition  dataset 
in  the  emergeney  response  domain  by  Blayloek  and  Allen  (2005).  This  domain  in- 
eludes  10  top  level  plans  like  setting  up  a  temporary  shelter,  elearing  a  road  wreek, 
and  providing  medieal  attention  to  vietims.  The  task  is  to  infer  a  single  instanti¬ 
ated  top-level  plan  based  on  a  set  of  observed  aetions  automatieally  generated  by 
a  hierarchical  transition  network  (HTN)  planner.  We  used  the  the  logieal  plan 


^Available  at  http :  / / www  .cs.utexas.edu/ users /ml/ accel .  html 
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knowledge  base  for  this  domain  eonstrueted  by  Raghavan  and  Mooney  (2011), 
whieh  eonsists  of  153  Horn  elauses.  We  used  1,000  artifieially  generated  exam¬ 
ples  in  our  experiments.  Eaeh  example  instantiates  one  of  the  10  top-level  plans 
and  eontains  an  average  of  10.19  ground  literals  deseribing  a  sample  exeeution  of 
this  plan. 


5.1.3.  Linux 

The  Linux  dataset  is  another  plan-reeognition  dataset  ereated  by  Blayloek  and 
Allen  (2004).  Human  users  were  asked  to  perform  various  tasks  in  Linux  and  their 
eommands  were  reeorded.  The  task  is  to  prediet  the  eorreet  top  level  plan  from  the 
sequenee  of  exeeuted  eommands.  Lor  example,  one  of  the  tasks  involves  finding 
all  files  with  a  given  extension.  The  dataset  eonsists  of  19  top  level  plan  sehemas 
and  457  examples,  with  an  average  of  6.1  eommand  literals  per  example.  Here 
again,  we  used  the  plan  knowledge  base  eonstrueted  by  Raghavan  and  Mooney 
(201 1),  whieh  eonsists  of  50  Horn  elauses. 

Laeh  of  these  data  sets  evaluates  a  distinet  aspeet  of  plan  reeognition  systems. 
Sinee  the  Monroe  domain  is  quite  large  with  numerous  subgoals  and  entities,  it 
tests  the  ability  of  a  plan-reeognition  system  to  seale  to  large  domains.  On  the 
other  hand,  the  Linux  data  set  is  not  that  large,  but  sinee  the  data  eomes  from  real 
human  users,  it  is  quite  noisy.  There  are  several  sourees  of  noise  ineluding  eases 
in  whieh  users  elaim  that  they  have  sueeessfully  exeeuted  a  top-level  plan  when 
aetually  they  have  not  (Blayloek  and  Allen,  2005).  Therefore,  this  data  set  tests 
the  robustness  of  a  plan-reeognition  system  to  noisy  input.  Monroe  and  Linux 
involve  predieting  a  single  top-level  plan;  however,  in  the  Story  Understanding 
domain,  most  examples  have  multiple  top-level  plans.  Therefore,  this  data  set 
tests  the  ability  of  a  plan-reeognition  system  to  identify  multiple  top-level  plans. 


5.2.  Comparison  ofBALPs,  MLNs,  and  Existing  Approaches 

In  this  seetion,  we  present  experimental  results  eomparing  the  performanee  of 
BALPs  and  MLNs  to  that  of  existing  plan  reeognition  approaehes  like  ACCLL 
(Ng  and  Mooney,  1992)  and  the  system  developed  by  Blayloek  and  Allen  (2005). 
ACCEL  is  a  purely  logieal  abduetive  reasoning  system  that  uses  a  variable  eval¬ 
uation  metrie  to  guide  its  seareh  for  the  best  explanation.  It  ean  use  two  differ- 
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ent  evaluation  metrics:  simplicity,  which  selects  the  explanation  with  the  fewest 
assumptions,  and  coherence,  which  selects  the  explanation  that  maximally  con¬ 
nects  the  input  observations  (Charniak  and  Goldman,  1990).  This  second  metric 
is  specifically  geared  towards  text  interpretation  by  measuring  explanatory  coher¬ 
ence  (Ng  and  Mooney,  1990).  Currently,  this  bias  has  not  been  incorporated  in 
either  BALPs  or  any  of  the  MLN  approaches.  Blaylock  and  Allen’s  system  is 
a  purely  probabilistic  system  that  learns  statistical  n-gram  models  to  separately 
predict  plan  schemas  (i.e.  predicates)  and  their  arguments. 

Section  4  describes  several  variants  of  MLNs  for  plan  recognition.  All  MLN  mod¬ 
els  were  implemented  using  Alchemy  (Kok  et  ah,  2010),  an  open  source  software 
package  for  learning  and  inference  in  Markov  logic.  We  used  the  logical  abduction 
component  developed  for  BALPs  (Algorithm  1)  for  abductive  model  construc¬ 
tion  in  MLNs.  Since  this  abductive  Markov-logic  formulation  (MLN-HCAM) 
performs  better  than  simple  re-encodings  of  the  traditional  MLN,  we  restrict  our 
comparative  experiments  to  this  approach.  For  more  details  on  the  experimental 
results  comparing  the  different  MLN  enhancements,  we  refer  the  reader  to  Singla 
and  Mooney  (2011).  For  the  Monroe  and  Linux  datasets,  we  also  compare  with 
the  traditional  (non-abductive)  MLN  approach  described  in  Section  4.4,  referred 
to  as  “MLN-manual.” 

For  BALPs,  we  learned  the  noisy-or  parameters  using  the  EM  algorithm  described 
in  Section  3.2  whenever  possible.  Similarly  for  both  MLN-HCAM  and  MLN, 
we  learned  the  parameters  using  the  algorithms  described  in  Section  4.5.  For 
datasets  that  had  multiple  top  plans  as  the  explanation,  we  computed  the  most 
probable  explanation  (MPE).  For  datasets  that  had  a  single  correct  top-level  plan, 
we  computed  the  marginal  probability  of  plan  instances  and  picked  the  one  with 
the  highest  marginal  probability.  For  both  MLNs  and  BALPs,  we  have  used  exact 
inference  whenever  feasible.  However,  when  exact  inference  was  intractable,  we 
used  approximate  sampling  to  perform  probabilistic  inference  -  Sample  Search 
(Gogate  and  Dechter,  2011)  for  BALPs  and  MC-SAT  (Poon  and  Domingos,  2006) 
for  MLNs.  Both  these  techniques  are  approximate  sampling  algorithms  designed 
for  graphical  models  with  deterministic  constraints.  Whenever  we  deviate  from 
this  standard  methodology,  we  provide  the  specific  details. 
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5.2.1.  Story  Understanding 


This  section  provides  information  on  the  methodology  and  results  for  experiments 
on  the  story-understanding  data  set. 

Parameter  Learning 

For  BALPs,  we  were  unable  to  learn  effective  parameters  from  the  mere  25  de¬ 
velopment  examples.  As  a  result,  we  set  parameters  manually  in  an  attempt  to 
maximize  performance  on  the  development  set.  A  uniform  value  of  0.9  for  all 
noisy-or  parameters  seemed  to  work  well  for  this  domain.  The  intuition  behind 
our  choice  of  noisy-or  parameters  is  as  follows:  if  a  parent  node  is  true,  then  with 
a  probability  of  0.9,  the  child  node  will  be  true.  In  other  words,  if  a  cause  is  true, 
then  the  effect  is  true  with  the  probability  of  0.9.  Using  the  deterministic  logical- 
and  model  to  combine  evidence  from  conjuncts  in  the  body  of  a  clause  did  not 
yield  high-quality  results.  Using  noisy-and  significantly  improved  the  results;  so 
we  used  noisy-and’s  with  uniform  parameters  of  0.9.  Here  again,  the  intuition  is 
that  if  parent  node  is  false  or  turned  off,  then  the  child  node  would  also  be  false  or 
turned  off  with  a  probability  0.9.  To  disambiguate  between  conflicting  plans,  we 
set  different  priors  for  high  level  plans  to  maximize  performance  on  the  develop¬ 
ment  data.  For  MLN-HCAM,  we  were  able  to  learn  more  effective  weights  from 
the  examples  in  the  development  set  using  the  learning  algorithm  described  in  the 
Section  4.5. 

Probabilistic  Inference 

Since  multiple  plans  are  possible  in  this  domain,  we  computed  the  most  probable 
explanation  (MPE)  to  infer  the  best  set  of  plans.  Since  the  resulting  graphical 
models  for  this  domain  were  not  exceptionally  large,  we  were  able  to  apply  exact 
(rather  than  approximate)  inference  algorithms.  For  BALPs,  we  used  the  k-MPE 
algorithm  (Nilsson,  1998)  as  implemented  in  Elvira  (Elvira-Consortium,  2002). 
For  MLN-HCAM,  we  used  with  the  cutting-plane  method  (and  the  associated 
code)  developed  by  Riedel  (2008). 

Evaluation  Metrics 

We  compared  BALPs  and  MLN-HCAM  with  ACCEL- Simplicity  and  ACCEL- 
Coherence.  We  compared  the  inferred  plans  with  the  ground  truth  to  compute 
precision  (the  percentage  of  the  inferred  plans  that  are  correct),  recall  (the  per¬ 
centage  of  the  correct  plans  that  were  properly  inferred),  and  F -measure  (the  har- 
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Precision 

Recall 

E-measure 

ACCEL-Simplicity 

66.45 

52.32 

58.54 

ACCEL-Coherence 

89.39 

89.39 

89.39 

BALP 

72.07 

85.57 

78.24 

MLN-HCAM 

69.13 

75.32 

72.10 

Table  1:  Results  for  the  Story  Understanding  dataset 

monic  mean  of  precision  and  recall).  Partial  credit  was  given  for  predicting  the 
correct  plan  predicate  with  some  incorrect  arguments.  A  point  was  rewarded  for 
inferring  the  correct  plan  predicate,  then,  given  the  correct  predicate,  an  additional 
point  was  rewarded  for  each  correct  argument.  For  example,  if  the  correct  plan 
was  plani{ai,  02)  and  the  inferred  plan  was  p/ani(ai,  03),  the  score  is  66.67%. 

Results 

Table  1  shows  the  results  for  Story  Understanding.  Both  BALPs  and  MLN- 
HCAM  perform  better  than  ACCEL-Simplicity,  demonstrating  the  advantage  of 
SRL  models  over  standard  logical  abduction.  BALPs  perform  better  than  MLN- 
HCAM,  demonstrating  an  advantage  of  a  directed  model  for  this  task.  However, 
ACCEL-Coherence  still  gives  the  best  results.  Since  the  coherence  metric  incor¬ 
porates  extra  criteria  specific  to  story  understanding,  this  bias  would  need  to  be 
included  in  the  probabilistic  models  to  make  them  more  competitive.  Incorporat¬ 
ing  this  bias  into  SRL  models  is  difficult  since  it  uses  the  graphical  structure  of  the 
abductive  proof  to  evaluate  an  explanation,  which  is  not  straightforward  to  include 
in  a  probabilistic  model.  It  should  be  noted  that  the  coherence  metric  is  specific  to 
narrative  interpretation,  since  it  assumes  the  observations  are  connected  together 
into  a  coherent  “story,”  and  this  assumption  is  not  generally  applicable  to  other 
plan  recognition  problems. 


5.2.2.  Monroe  and  Linux 

This  section  provides  information  on  the  methodology  and  results  for  experiments 
on  the  Monroe  and  Linux  data  sets. 

Parameter  Learning 

For  BALPs,  we  learned  the  noisy-or  parameters  automatically  from  data  using  the 
EM  algorithm  described  in  Section  3.2.  We  initially  set  all  noisy-or  parameters 
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to  0.9,  which  gave  reasonable  performanee  in  both  domains.  We  pieked  a  default 
value  of  0.9  based  on  the  intuition  that  if  a  parent  node  is  true,  then  the  ehild  node 
is  true  with  a  probability  0.9.  We  then  ran  EM  with  two  starting  points  -  random 
weights  and  manual  weights  (0.9).  We  found  that  EM  initialized  with  manual 
weights  generally  performed  the  best  for  both  domains,  and  henee  we  used  this 
approaeh  for  our  eomparisons.  Even  though  EM  is  sensitive  to  its  starting  point, 
it  outperformed  other  approaches  even  when  initialized  with  random  weights.  Eor 
Monroe  and  Einux,  initial  experiments  found  no  advantage  to  using  noisy-and 
instead  of  logieal-and  in  these  domains,  so  we  did  not  experiment  with  learning 
noisy-and  parameters. 

Eor  MEN-HCAM,  we  learned  the  weights  automatieally  from  data  using  the  meth¬ 
ods  deseribed  in  Seetion  4.5.  Eor  MEN-manual,  the  online  weight  learner  did  not 
provide  any  improvement  over  default  manually-eneoded  weights  (a  weight  of  1 
for  all  the  soft  elauses  and  a  weight  of  -0.5  on  unit  elauses  for  all  plan  predieates 
to  speeify  a  small  prior  for  all  plans).  Therefore,  we  report  results  obtained  using 
these  manually-eneoded  weights. 

Eor  Monroe,  of  the  1,000  examples  in  our  dataset  we  used  the  first  300  for  training, 
the  next  200  for  validation,  and  the  remaining  500  examples  for  test.  Blayloek  and 
Allen  learned  their  parameters  on  4,500  artifieially  generated  examples.  However, 
we  found  that  using  a  large  number  of  examples  resulted  in  mueh  longer  training 
times  and  that  300  examples  were  suffieient  to  learn  effeetive  parameters  for  both 
BALPs  and  MEN-HCAM.  Eor  BALPs,  we  ran  EM  on  the  training  set  until  we 
saw  no  further  improvement  in  the  performanee  on  the  validation  set.  Eor  MEN- 
HCAM,  the  parameter  learner  was  limited  to  training  on  at  most  100  examples, 
as  learning  on  larger  amounts  of  data  ran  out  of  memory.  Therefore,  we  trained 
MEN-HCAM  on  3  disjoint  subsets  of  the  training  data  and  pieked  the  best  model 
using  the  validation  set. 

Eor  Einux,  we  performed  10-fold  eross  validation.  Eor  BALPs,  we  ran  EM  until 
eonvergenee  on  the  training  set  for  each  fold.  Eor  MEN-HCAM,  within  each  train¬ 
ing  fold,  we  learned  the  parameters  on  disjoint  subsets  of  data,  eaeh  eonsisting  of 
around  110  examples.  As  mentioned  before  for  Monroe,  the  parameter  learner  did 
not  seale  to  larger  data  sets.  We  then  used  the  rest  of  the  examples  in  eaeh  training 
fold  for  validation,  pieking  the  model  that  best  performed  on  the  validation  set. 

As  diseussed  in  seetion  4.4,  in  the  traditional  MEN  model,  there  are  two  ways  to 
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encode  the  bias  that  only  a  single  plan  is  needed  to  explain  a  given  action.  The 
first  approach  is  to  include  explicit  hard  mutual-exclusivity  constraints  between 
competing  plans,  the  second  approach  involves  setting  a  low  prior  on  all  plan 
predicates.  While  the  former  performed  better  on  Monroe,  the  latter  gave  better 
results  on  Linux.  We  report  the  results  of  the  best  approach  for  each  domain. 

As  noted,  we  had  to  adopt  a  slightly  different  training  methodology  for  BALPs 
and  MLN-HCAM  due  to  computational  limitations  of  MLN  weight  learning  on 
large  datasets.  However,  we  used  the  exact  same  test  sets  for  both  systems  on  all 
datasets.  Developing  more  scalable  online  methods  for  training  MLNs  on  partially 
observable  data,  is  an  important  direction  for  future  work. 

Probabilistic  Inference 

Both  Monroe  and  Linux  involve  inferring  a  single  top-level  plan.  Therefore,  we 
computed  the  marginal  probability  of  each  plan  instantiation  and  picked  the  most 
probable  one.  For  BALPs,  since  exact  inference  was  tractable  on  Linux,  we  used 
the  exact  inference  algorithm  implemented  in  Netica,^  a  commercial  Bayes-net 
software  package.  For  Monroe,  the  complexity  of  the  ground  network  made  ex¬ 
act  inference  intractable  and  we  used  SampleSearch  (Gogate  and  Dechter,  2011), 
an  approximate  sampling  algorithm  for  graphical  models  with  deterministic  con¬ 
straints.  For  both  MLN  approaches,  we  used  MC-SAT  (Poon  and  Domingos, 
2006)  as  implemented  in  the  Alchemy  system  on  both  Monroe  and  Linux. 

Evaluation  Metric 

We  compared  the  performance  of  BALPs,  MLN-HCAM  and  MLN  with  Blaylock 
and  Allen’s  system  using  the  convergence  score  as  defined  by  Blaylock  and  Allen 
(2005).  The  convergence  score  measures  the  fraction  of  examples  for  which  the 
correct  plan  predicate  is  inferred  (ignoring  the  arguments)  when  given  all  of  the 
observations.  Use  of  the  convergence  score  allowed  for  the  fairest  comparison  to 
the  original  results  on  these  datasets  published  by  Blaylock  and  Allen.^ 

Results 

Table  2  shows  convergence  score  for  both  the  Monroe  and  Linux  datasets.  Both 


^http : / / WWW . norsys . com/ 

"^Blaylock  and  Allen  also  report  results  on  predicting  arguments,  but  using  a  methodology  that 
makes  a  direct  comparison  difficult. 
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Monroe 

Linux 

Blaylock 

94.20 

36.10 

MLN-manual 

90.80 

16.19 

MLN-HCAM 

97.00 

38.94 

BALP 

98.40 

46.60 

Table  2:  Convergence  scores  for  Monroe  and  Linux  datasets 

BALPs  and  MLN-HCAM  recognize  plans  in  these  domains  more  accurately  than 
Blaylock  and  Allen’s  system.  The  performance  of  BALP  and  MLN-HCAM  are 
fairly  comparable  on  Monroe;  however  BALPs  are  significantly  more  accurate  on 
Linux.  The  traditional  MLN  performs  the  worst,  and  does  particularly  poorly  on 
Linux.  This  demonstrates  the  value  of  the  abductive  approach  as  implemented  in 
MLN-HCAM. 

Partial  Observability  Results 


The  convergence  score  has  the  following  limitations  as  a  metric  for  evaluating  the 
performance  of  plan  recognition: 

1 .  It  only  accounts  for  predicting  the  correct  plan  predicate,  ignoring  the  ar¬ 
guments.  In  most  domains,  it  is  important  for  a  plan-recognition  system  to 
accurately  predict  arguments  as  well.  For  example,  in  the  Linux  domain,  if 
the  user  is  trying  to  move  “testl.txt”  to  “test-dir”,  it  is  not  sufficient  to  just 
predict  the  move  command;  it  is  also  important  to  predict  the  file  (test.txt) 
and  the  destination  directory  (test-dir). 

2.  It  only  evaluates  plan  prediction  after  the  system  has  observed  all  of  the 
executed  actions.  However,  in  most  cases,  we  would  like  to  be  able  to 
predict  plans  after  observing  as  few  actions  as  possible. 


In  order  to  evaluate  the  ability  of  BALPs  and  MLNs  to  infer  plan  arguments  and 
to  predict  plans  after  observing  only  a  partial  execution,  we  conducted  an  addi¬ 
tional  set  of  experiments.  Specifically,  we  performed  plan  recognition  after  ob¬ 
serving  the  first  25%,  50%,  75%,  and  100%  of  the  executed  actions.  To  measure 
performance,  we  compared  the  complete  inferred  plan  (with  arguments)  to  the 
gold-standard  to  compute  an  overall  accuracy  score.  As  for  Story  Understanding, 
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25% 

50% 

75% 

100% 

MEN-manual 

10.63 

13.23 

37.00 

67.13 

MEN-HCAM 

15.93 

19.93 

43.93 

76.30 

BAEP 

07.33 

20.26 

44.63 

79.16 

Table  3:  Accuracy  on  Monroe  data  for  varying  levels  of  observability 


25% 

50% 

75% 

100% 

MEN-manual 

10.61 

10.72 

10.61 

10.61 

MEN-HCAM 

16.30 

16.48 

24.36 

28.84 

BAEPs 

19.83 

25.45 

34.06 

36.32 

Table  4:  Accuracy  on  Linux  data  for  varying  levels  of  observability 

partial  credit  was  given  for  predieting  the  eorreet  plan  predicate  with  only  a  subset 
of  its  eorreet  arguments. 

Table  3  shows  the  results  for  partial  observability  for  the  Monroe  data.  BALPs  per¬ 
forms  slightly  better  than  MLN-HCAM  on  higher  levels  of  observability,  whereas, 
MLN-HCAM  tends  to  outperform  BALP  on  lower  levels  of  observability.  The 
MLN-manual  performs  worst  at  higher  levels  of  observability,  but  at  25%  ob¬ 
servability,  it  out-performs  BALPs.  Table  4  shows  the  results  for  partial  observ¬ 
ability  on  the  Linux  data.  Here,  BALPs  clearly  outperform  MLN-HCAM  and 
traditional  MLNs  at  all  the  levels  of  observability.  The  traditional  MLN  performs 
signifieantly  worse  than  the  other  two  models,  espeeially  at  higher  levels  of  ob¬ 
servability.  For  Story  Understanding,  sinee  the  set  of  observed  aetions  is  already 
ineomplete,  we  did  not  perform  additional  experiments  for  partial  observability. 


5.2.3.  Discussion 

We  believe  several  aspeets  of  SRL  approaches  led  to  their  superior  performance 
over  existing  approaches  like  ACCEL  and  Blaylock  and  Allen’s  system.  The  abil¬ 
ity  of  BALPs  and  MLN-HCAM  to  perform  probabilistie  reasoning  most  likely 
resulted  in  their  improved  performanee  over  ACCEL-Simplicity,  a  standard  logi- 
eal  abduetion  method.  When  Blayloek  and  Allen  (2005)  perform  instantiated  plan 
reeognition,  it  is  done  in  a  pipeline  of  two  separate  steps.  The  first  step  predicts 
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the  plan  schema  and  the  second  step  predicts  the  arguments  given  the  schema. 
Unlike  their  approach,  BALPs  and  MLN-HCAM  are  able  to  jointly  predict  both 
the  plan  schema  and  its  arguments  simultaneously.  We  believe  that  this  ability  of 
these  SRL  models  to  perform  joint  prediction  of  plans  and  their  arguments  is  at 
least  partially  responsible  for  their  superior  performance.  In  addition,  both  BALPs 
and  MLN-HCAM  utilize  prior  planning  knowledge  encoded  in  the  logical  clauses 
given  to  the  system,  while  Blaylock  and  Allen’s  system  has  no  access  to  such 
planning  knowledge.  We  believe  that  the  ability  of  SRL  models  to  incorporate 
such  prior  domain  knowledge  also  contributes  to  their  superior  performance. 

MLN-manual,  although  a  joint  model,  cannot  take  advantage  of  all  of  the  domain 
knowledge  in  the  planning  KB  available  to  BALPs  and  MLN-HCAM.  Also,  it 
does  not  have  the  advantages  offered  by  the  abductive  model  construction  pro¬ 
cess  used  in  MLN-HCAM.  This  also  makes  it  difficult  to  adequately  learn  the 
parameters  for  this  model.  We  believe  these  factors  lead  to  its  overall  inferior 
performance  compared  to  the  other  models.  For  both  the  Monroe  and  Linux  do¬ 
mains,  the  relative  gap  in  the  performance  of  MLN-manual  model  decreases  with 
decreasing  observability.  This  is  particularly  evident  in  Linux,  where  the  perfor¬ 
mance  stays  almost  constant  with  decreasing  observability.  We  believe  this  is  due 
to  the  model’s  ability  to  capitalize  on  even  a  small  amount  of  information  that 
deterministically  predicts  the  top-level  plan.  Furthermore,  at  lower  levels  of  ob¬ 
servability,  the  ground  networks  are  smaller  and  therefore,  approximate  inference 
is  more  likely  to  be  accurate.  Singla  and  Mooney  (201 1)  report  that  MLN-PC  and 
MLN-HC  models  did  not  scale  well  enough  to  make  them  tractable  for  the  Monroe 
and  Linux  domains.  When  compared  to  these  models,  MLN-manual  has  a  sub¬ 
stantial  advantage.  But  it  still  does  not  perform  nearly  as  well  as  the  MLN-HCAM 
model.  This  re-emphasizes  the  importance  of  using  a  model  that  is  constructed  by 
focusing  on  both  the  query  and  the  available  evidence.  Furthermore,  the  MLN- 
manual  approach  requires  costly  human  labor  to  engineer  the  knowledge  base. 
This  is  in  contrast  to  the  abductive  MLN  models  that  allow  the  same  knowledge 
base  to  be  used  for  both  planning  and  plan  recognition. 

Comparing  BALPs  and  MLN-HCAM,  BALPs  generally  performed  better.  We  be¬ 
lieve  this  difference  is  partly  due  to  the  advantages  that  directed  graphical  models 
have  for  abductive  reasoning,  as  originally  discussed  by  Pearl  (1988).  Note  that 
MLN-HCAM  already  incorporates  several  ideas  that  originated  with  BALPs.  By 
using  hidden  causes  and  noisy-or  to  combine  evidence  from  multiple  rules,  and 
by  utilizing  logical  abduction  to  obtain  a  focused  set  of  literals  for  the  ground  net- 
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work,  we  improved  the  performanee  of  MLNs  by  making  them  produee  a  graph- 
ieal  model  that  is  more  similar  to  the  one  produced  by  BALPs.  Although,  in 
principle,  any  directed  model  can  be  re-expressed  as  an  undirected  model,  the 
learning  of  parameters  in  the  corresponding  undirected  model  can  be  significantly 
more  complex  since  there  is  no  closed  form  solution  for  the  maximum-likelihood 
set  of  parameters  unlike  in  the  case  of  directed  models  ^  (Koller  and  Friedman, 
2009).  Inaccurate  learning  of  parameters  can  lead  to  potential  loss  of  accuracy 
during  final  classification.  Undirected  models  do  have  the  advantage  of  repre¬ 
senting  cyclic  dependencies  (such  as  transitivity),  which  directed  models  can’t 
represent  explicitly  because  of  the  acyclicity  constraint.  But  we  did  not  find  it 
particularly  useful  for  plan  recognition  since  the  domain  knowledge  is  expressed 
using  rules  that  have  an  inherent  directional  (causal)  semantics.  In  addition,  it  is 
very  difficult  to  develop  a  general  MLN  method  that  constructs  a  Markov  net  that 
is  formally  equivalent  to  the  Bayes  net  constructed  by  a  BALP  given  the  same 
initial  planning  knowledge  base. 

In  general,  it  took  much  more  engineering  time  and  effort  to  get  MLNs  to  perform 
well  on  plan  recognition  compared  to  BLPs.  Extending  BLPs  with  logical  ab¬ 
duction  was  straightforward.  The  main  problem  we  encountered  while  adapting 
BLPs  to  work  well  on  our  plan  recognition  benchmarks  was  finding  an  effec¬ 
tive  approximate  inference  algorithm  that  scaled  well  to  the  larger  Monroe  and 
Linux  datasets.  Once  we  switched  to  Sample  Search,  which  is  designed  to  work 
well  with  the  mix  of  soft  and  hard  constraints  present  in  our  networks,  BALPs 
produced  good  results.  However,  getting  competitive  results  with  MLNs  and  scal¬ 
ing  them  to  work  with  our  larger  datasets  was  significantly  more  difficult.  First, 
we  needed  to  develop  a  method  for  properly  introducing  reverse  clauses  to  allow 
MLNs  to  perform  logical  abduction.  Next,  we  had  to  develop  a  method  for  in¬ 
troducing  hidden  causes  in  order  to  prevent  the  creation  of  networks  with  large 
cliques  that  made  inference  intractable.  Finally,  we  had  to  develop  an  abductive 
model  construction  process  that  used  the  ground  literals  constructed  for  BALPs 
to  constrain  the  size  and  scope  of  the  ground  Markov  net.  Even  after  all  these 
modifications,  the  weight-learning  algorithm  did  not  scale  to  larger  training  sets, 
and  the  overall  results  are  still  not  competitive  with  BALPs. 


^In  directed  models,  a  closed  form  solution  exists  for  the  case  of  full  observability.  This  corre¬ 
sponds  to  the  M  step  in  EM  when  dealing  with  partial  observability. 
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Some  of  the  differences  in  the  performance  of  BALPs  and  MLN-HCAM  may  also 
stem  from  the  differences  in  the  probabilistic  inference  and  parameter-learning 
algorithms  employed.  For  instance,  on  the  Linux  dataset,  we  could  run  exact 
inference  for  BALPs,  but  we  had  to  resort  to  MC-SAT,  an  approximate  sam¬ 
pling  algorithm,  for  MLN-HCAM.  On  Monroe,  even  though  we  used  approximate 
sampling  algorithms  for  both  BALPs  and  MLN-HCAM,  it  is  unclear  whether 
the  performance  of  Sample  Search  and  MC-SAT  are  comparable.  Furthermore, 
since  probabilistic  inference  is  used  extensively  during  parameter  learning,  per¬ 
formance  of  the  inference  techniques  could  impact  the  quality  of  the  learned 
weights/parameters.  In  our  preliminary  work,  we  converted  the  noisy-or  parame¬ 
ters  learned  using  the  EM  algorithm  for  BALPs  into  weights  in  MLN-HCAM. 
When  we  performed  plan  recognition  using  these  weights,  we  found  that  the 
performance  of  MLN-HCAM  improved,  demonstrating  a  lack  of  quality  in  the 
learned  MLN  weights.  This  could  be  due  either  to  poor  performance  of  proba¬ 
bilistic  inference,  or  to  poor  performance  of  the  weight  learner  itself.  Additional 
experiments  that  control  for  changes  in  the  inference  and  learning  algorithms  are 
needed  to  further  understand  the  effects  of  these  differences. 


5.3.  Comparison  of  BALPs  and  MLNs  to  other  SRL  models 

BLPs,  BALPs,  and  MLNs  are  all  languages  for  flexibly  and  compactly  repre¬ 
senting  large,  complex  probabilistic  graphical  models.  An  alternative  approach 
to  SRL  is  to  add  a  stochastic  element  to  the  deductive  process  of  a  logic  pro¬ 
gram.  ProbLog  (Kimmig  et  ah,  2008),  is  the  most  recent  and  well-developed  of 
these  approaches.  ProbLog  can  be  seen  as  extending  and  subsuming  several  pre¬ 
vious  models,  such  as  Poole’s  Probabilistic  Horn  Abduction  (PHA)  (Poole,  1993) 
and  PRISM  (Sato,  1995).  Finally,  there  is  publicly-available  implementation  of 
ProbLog  ®  that  exploits  the  latest  inference  techniques  based  on  binary  decision 
diagrams  (BDDs)  to  provide  scalability  and  efficiency. 

Therefore,  we  attempted  to  also  compare  the  performance  of  our  models  to  ProbLog 
It  was  relatively  straightforward  to  develop  a  ProbLog  program  for  plan-recognition 
by  appropriately  formulating  the  planning  KB  used  for  both  BLPs  and  abductive 
MLNs.  However,  our  preliminary  explorations  with  ProbLog  revealed  a  serious 
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limitation  that  prevented  us  from  aetually  performing  an  experimental  eomparison 
on  our  plan  reeognition  datasets.  In  a  number  of  the  planning  axioms  in  our  KBs, 
existentially  quantified  variables  oeeur  in  the  body  of  a  elause  whieh  do  not  oeeur 
in  the  head.  Representing  these  elauses  in  ProbLog  requires  binding  sueh  vari¬ 
ables  to  all  possible  type-eonsistent  eonstants  in  the  domain.  However,  this  results 
in  the  ProbLog  inferenee  engine  attempting  to  eonstruet  an  intraetable  number  of 
explanations  (i.e.  proofs)  due  to  the  eombinatorial  number  of  possible  eombina- 
tions  of  these  introdueed  eonstants.  Therefore,  it  was  intraetable  to  run  ProbLog 
on  our  datasets,  preventing  an  empirieal  eomparison.  BALPs  and  MLN-HCAM 
use  a  greedy  abduetive-proof  eonstruetion  method  deseribed  in  seetion  3.1  to  pre¬ 
vent  this  eombinatorial  explosion.  Therefore,  we  believe  ProbLog  would  need  a 
new  approximate  inferenee  algorithm  for  this  situation  in  order  to  be  praetieally 
useful  for  plan  reeognition. 

Abduetive  Stoehastie  Logie  Programs  (ASLPs)  (Chen  et  ah,  2008)  are  another 
SRL  model  that  uses  stoehastie  deduetion  and  supports  logieal  abduetion  and, 
therefore,  eould  potentially  be  applied  to  plan  reeognition.  However,  we  are  un¬ 
aware  of  a  publiely- available  implementation  of  ASLPs  that  eould  be  easily  used 
for  experimental  eomparisons. 


6.  Future  Work 


The  researeh  presented  in  this  paper  eould  be  extended  in  various  ways.  First,  it 
would  be  good  to  evaluate  the  proposed  plan-reeognition  systems  on  additional 
domains  and  applieations.  Unfortunately,  there  are  very  few  publiely-available 
datasets  for  plan  reeognition. 

Seeond,  the  existing  SRL  methods  eould  be  improved  and  extended  in  several 
produetive  direetions.  Methods  for  lifted  inferenee  (Singla  and  Domingos,  2008) 
eould  improve  effieieney  by  allowing  probabilistie  inferenee  to  be  performed  with¬ 
out  having  to  explieitly  eonstruet  eomplete  ground  networks.  In  partieular,  the 
latest  Probabilistie  Theorem  Proving  (PTP)  methods  for  lifted  inferenee  in  MLNs 
(Gogate  and  Domingos,  2011)  eould  be  tried  to  improve  the  effieieney  and  aeeu- 
raey  of  the  MLN  models. 

Improved  on-line  weight  learning  algorithms  eould  be  developed  to  more  effi- 
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ciently  train  on  large  datasets  and  inerease  the  aeeuraey  of  the  learned  models.  In 
partieular,  diseriminative  rather  than  generative  (i.e.  EM)  parameter  learning  for 
BALPs  should  be  explored.  Although  diseriminative  learning  is  more  diffieult  for 
direeted  graphieal  models  than  for  undireeted  ones,  there  has  been  reeent  progress 
on  this  problem  (Carvalho  et  ah,  2011).  Current  diseriminative  on-line  weight 
learners  for  MLNs  (Huynh  and  Mooney,  2011a)  assume  eompletely  observable 
training  data.  These  methods  are  not  applieable  to  abduetive  MLNs,  whieh  eon- 
tain  unobserved  sub-goal  and  noisy-or  nodes.  Therefore,  effeetive  on-line  meth¬ 
ods  for  partially-observed  training  data  need  to  be  developed. 

With  respeet  to  the  traditional  MEN  approaeh,  better  methods  for  manually  en¬ 
gineering  effeetive  rules  for  deduetive  plan  reeognition  eould  be  developed.  Al¬ 
ternatively,  MEN  strueture  learning  (Kok  and  Domingos,  2005,  2010;  Huynh  and 
Mooney,  201  lb)  eould  be  used  to  automatieally  induee  sueh  rules  from  supervised 
training  data.  In  addition,  a  similar  approaeh  eould  be  developed  for  applying  tra¬ 
ditional  (deduetive)  BLPs  to  plan  reeognition. 

The  eurrent  experimental  eomparisons  should  be  extended  to  additional  SRL  mod¬ 
els.  As  mentioned  in  seetion  5.3,  an  improved  approximate  inferenee  method  is 
needed  to  make  ProbLog  traetable  for  our  plan-reeognition  problems.  Compar¬ 
isons  to  other  SRL  models  sueh  as  Poole’s  Horn  Abduetion  (Poole,  1993),  PRISM 
(Sato,  1995),  and  Abduetive  Stoehastie  Logie  Programs  (Chen  et  ah,  2008),  are 
also  indieated. 


7.  Conclusions 


This  paper  has  introdueed  two  new  SRL  approaches  to  plan  recognition,  one 
based  on  Bayesian  Logic  Programs  (BLPs),  the  other  on  Markov  Logic  Networks 
(MLNs).  Both  of  these  approaches  combine  the  advantages  of  prior  logical  and 
probabilistic  methods.  We  presented  novel  techniques  for  extending  both  MLNs 
and  BLPs  with  logical  abduction  in  order  to  allow  for  plan  recognition  given  log¬ 
ical  definitions  of  actions  and  plans  as  the  only  prior  knowledge.  Experimental 
evaluations  on  three  benchmark  data  sets  have  shown  that  our  approaches  gen¬ 
erally  outperform  other  state-of-the-art  methods  in  plan  recognition.  We  believe 
their  superior  performance  is  due  to  the  combination  of  logical  abduction,  joint 
probabilistic  inference,  and  incorporation  of  planning  domain  knowledge.  The 
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results  also  indicate  that  the  approach  based  on  BLPs  is  generally  more  effective 
than  the  one  based  on  MLNs. 
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